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The key structural and signaling roles 
of collagen in the extracellular matrix 
(ECM) make it an attractive biomaterial 
for tissue engineering, but there are lim- 
itations in the standardization and purity 
of natural collagen sources currently avail- 
able for such applications (Ruggiero and 
Koch, 2008; Werkmeister and Ramshaw, 
2012). Significant effort has been made 
to produce human collagen in recombi- 
nant systems, such as yeast, insect cells and 
plants (Ruggiero et al, 2000; Myllyharju, 
2009). However, the requirement for post- 
translational proline hydroxylation has 
proven to be a significant obstacle in 
achieving large scale production. Recent 
findings of collagen-like proteins in bacte- 
ria suggest these may represent alternative 
biosynthetic collagen materials which may 
complement current sources. 

Over the past 10 years, collagen-like 
proteins have been identified from numer- 
ous bacterial genomes database based 
on the signature (Gly-Xaa-Yaa) n repeat- 
ing amino acid sequence characteristic 
of the collagen triple-helix (Rasmussen 
et al, 2003; Yu et al, 2014). Some of 
these collagen-like molecules may func- 
tion as virulence factors by bacteria to 
evade the immune system of higher ani- 
mals or to interact with surface recep- 
tors or with other ECM molecules nec- 
essary to promote host cell invasion 
(Humtsoe et al., 2005). More than 100 
putative collagen-like proteins have been 
identified in bacterial genomes, of which 
eight have been recombinantly expressed 
in Escherichia coli (see Yu et al, 2014 
for review). All eight expressed bacterial 



collagens were shown to form stable 
triple -helices with T m ~35-39°C. E. coli 
and most bacteria lack prolyl hydroxy- 
lase, so this high stability is attained in 
the absence of hydroxyproline (Hyp), a 
post-translationally modified amino acid 
known to be critical to the thermal stabil- 
ity of mammalian collagens. Initial interest 
in bacterial collagen-like proteins focused 
on their roles in pathogenesis. However, 
recent work has focused on one specific 
bacterial collagen protein, designated Scl2, 
to demonstrate the utility of recombinant 
bacterial collagen as a tool for defining 
collagen sequence/structure/function rela- 
tionships and for establishing a class of 
novel collagen-based biomaterials. 

The gram positive bacterium 
Streptococcus pyogenes contains two 
collagen-like proteins, Sell and Scl2, which 
have been well characterized in terms 
of structure and functional properties 
(Lukomski et al, 2001; Xu et al., 2002; 
Mohs et al, 2007; Caswell et al, 2008). 
The Scl2 protein includes an N-terminal 
globular trimerization domain adjacent 
to a (Gly-Xaa-Yaa)79 core collagen-like 
domain. It has been possible to generate 
constructs in a recombinant E. coli system 
with various sequence modifications of 
Scl2 and to establish large scale production 
methods. Based on recent progress, we 
suggest that the Scl2 recombinant bacterial 
collagen system has advantages compared 
to recombinant human collagen strategies 
for large scale production and biomedical 
applications, and may serve as a proto- 
type for engineering novel collagen-based 
biomaterials. 



STABLE TRIPLE HELICAL PROTEIN 
WITHOUT HYDROXYPROLINE 

The recombinant Scl2 protein and its 
modified variants are able to form a 
triple helix with stability similar to that 
of mammalian collagens (T m ~37°C) even 
though there is no Pro hydroxylation. 
In animal collagens, Pro residues in 
the Y positions of the repeating Gly-X- 
Y sequence are hydroxylated to Hyp, and 
this hydroxylation is required to stabi- 
lize the triple-helix: T m of hydroxylated 
collagen = 37°C, while T m of unhy- 
droxylated collagen = 26°C (Berg and 
Prockop, 1973; Jimenez et al, 1973). The 
bacterial collagen appears to compensate 
for the absence of Hyp with electro- 
static interactions (Mohs et al., 2007). In 
yeast, plants, and other expression sys- 
tems for human collagens, the genes for 
the alpha and beta subunits of human 
prolyl hydroxylase (P4H) must be intro- 
duced to form stable collagen molecules 
(Ruggiero et al, 2000; Myllyharju, 2009; 
Xu et al, 2011; Shoseyov et al, 2014). 
However, P4H activity and hydroxylation 
levels are highly dependent on expression 
conditions, such as the gene copy ratio of 
collagen to hydroxylase, concentration of 
cofactors, induction time, and sequence 
(Chan et al., 2012), so generating opti- 
mal hydroxylation to model native human 
collagen has been challenging in systems 
with inserted P4H genes and even in 
mammalian cell expression systems. The 
bacterial collagen-like proteins are highly 
compatible with expression in E. coli, and 
the lack of requirement for any post- 
translational modifications presents the 
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potential for producing large quantities of 
recombinant stable triple -helical proteins 
for biomaterials applications. 

MIX AND MATCH FUNCTIONAL 
MODULAR UNITS 

In addition to production advantages, the 
ability to express the collagen-like pro- 
tein in E. coli allows easy manipulation 
of the sequence to enhance biomaterial 
properties. Insertion of known human col- 
lagen binding sites within Scl2 presents 
the possibility of designing collagen-like 
materials with defined biological proper- 
ties. The native human collagen sequence 
contains more than 40 binding sites which 
interact with other biologically functional 
molecules, including cell receptors, other 
ECM proteins and collagenases (MMPs) 
(Kadler, 1994). Examination of the Scl2 
recombinant protein suggests that this 
particular bacterial collagen-like protein 
lacks any known biologically active sites, 
making it a convenient blank slate for 
the introduction of human collagen lig- 
and binding sites (Caswell et al, 2008; 
Cosgriff-Hernandez et al, 2010). Various 
human-derived collagen short amino acid 
sequences (2-6 triplets) with identified 
bioactive sites have been inserted into 
the Scl2 collagen-like domain, including 
sites for binding integrin a2fH (Seo et al, 
2010), fibronectin (An et al, 2014), hep- 
arin (Peng et al., 2013), and MMPs (Yu 
et al., 2012). The inserted sequences have 
conferred the expected biological activi- 
ties on the Scl2 protein based on in vitro 
binding, cleavage, and cell culture assays. 
The Scl2 sequence will apparently fold 
the short human collagen insert into a 
triple helix, while the correctly folded 
human collagen sequence imparts the 
corresponding biological function to the 
chimeric protein. The introduction of 
the integrin and fibronectin binding sites 
promoted the growth of different types 
of mammalian cells in vitro, while Scl2 
molecules with MMP cleavage sites show 
enzyme cleavage at the native position. 
More collagen bioactive sites are being 
identified each year primarily through 
synthetic collagen mimetic Toolkit pep- 
tides (Farndale et al., 2008), providing an 
increasing pool of candidates for a recom- 
binant chimeric bacteria-human library 
with human collagen functions. Such a 
recombinant library is particularly useful 



for applying a synthetic biology "plug and 
play" concept to future biomaterial designs 
(Figure 1). Only those collagen biological 
functions that are required for a particu- 
lar application will be selected and incor- 
porated into a final product, leading to 
precise tuning for each application. 

MOLECULAR AND HIGHER ORDER 
STRUCTURE 

With recombinant DNA technologies, it 
has been possible to manipulate the Scl2 
protein, studying individual fragments 
of the triple-helix (Yu et al, 2011) or 
duplicating the entire triple-helix portion 
(Yoshizumi et al, 2009), while maintain- 
ing secondary structure and stability. Scl2 
is a homotrimer, serving as a model for 
homotrimeric collagens such as type II 
collagen in cartilage. To better mimic het- 
erotrimeric collagens such as the domi- 
nant type I collagen in bone and tendon 
or type IV collagen in basement mem- 
brane, it may be possible to replace the 
natural trimerization V domain with a 
heterotrimic coiled coil (Nautiyal et al., 
1995) or an electrostatic interaction chain 
selection mechanism (Jalan et al., 2013). 

While facile manipulation of the Scl2 
protein can optimize molecular and bio- 
logical properties, successful biomaterial 
applications will require optimization 
of material properties as well. The most 
abundant animal collagens self-associate 
into characteristic periodic cross-linked 
fibrils, which provide the mechanical 
properties for tissues and biomaterials. 
Thus far, Scl2 exhibits limited ability to 
form fibrillar structures (Yoshizumi et al., 
2009). The Scl2 triple-helix is only about 
1/5 of the length of the human fibrillar 
collagen triple-helix, and it is possible 
that increasing its length will promote 
self-association. The E. coli system usu- 
ally has an upper threshold for the size of 
the recombinant protein it can produce, 
but it may be possible to transfer the Scl2 
expression system into yeast or insect cells 
to further increase its chain length. Crystal 
structures on small collagen model pep- 
tides and studies on recombinant tobacco 
type I collagen suggest Hyp may be essen- 
tial for collagen fibril formation (Kramer 
et al, 2000; Perret et al., 2001). Thus, it 
may be necessary to introduce P4H genes 
into the bacterial system in order to induce 
fibril formation, even though the Hyp is 



not necessary to form the stable triple- 
helix. Alternative approaches to attaining 
an optimal self-supporting material from 
Scl2 include chemical modifications such 
as glutaraldehyde vapor or l-ethyl-3- 
(3-dimethylaminopropyl)carbodiimide 
or crosslinking by poly(ethylene glycol) 
diacrylate to form interchain networks 
(Cosgriff-Hernandez et al, 2010; Peng 
et al., 2010). Chemical modifications 
are simple and effective, but the level of 
crosslinking maybe difficult to control, 
especially in large 3D scaffolds, and a high 
degree of cross-linking may limit acces- 
sibility to biologically active sites within 
the collagen material. Another strategy 
recently reported involved non-covalent 
binding of Scl2 constructs with fibronectin 
and integrin interaction sites to solid silk 
protein material, generating porous silk 
scaffolds with improved support of cell 
growth (An et al, 2013) (Figure 1). 

PURIFICATION, SCALABILITY, AND 
PROJECTED COST 

Animal extracted collagens are produced 
in large quantity and are generally inex- 
pensive. However, difficulties in devel- 
oping standardized preparations of these 
collagens and in producing minor types 
of collagens that are free of collagen I 
and biological contaminants are major 
limitations. Potential infectious and aller- 
gic risks of animal collagen products are 
also a concern. The recombinant bac- 
terial collagens have attached tags, such 
as His-tag and Strep II tags to simplify 
standardized chromatographic purifica- 
tion. A high-throughput batch purifica- 
tion methodology for Scl2 has also been 
developed (Peng et al., 2014). Native triple 
helical sequences are resistant to digestion 
by non-specific enzymes such as trypsin 
and chymotrypsin, and trypsin treatment 
of acidified cell lysate resulted in purified 
triple helical bacterial collagen. Enzymatic 
digestion during the purification process 
ensures the final product will be free of 
non-triple helical contaminants, which is 
important for quality control for indus- 
trial production. Recombinant expression 
of proteins in E. coli is a mature indus- 
trial process with excellent scalability. This 
has already been demonstrated for Scl2 
production (Peng et al., 2012) with an 
average yield of 0.2-0.3 g/L of purified col- 
lagen protein in traditional shaking flask 
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FIGURE 1 | Schematic diagram of recombinant bacterial collagen production method and material fabrication strategies. hCol, human collagen; bCol, 
recombinant bacterial collagen Scl2. 



culture and up to 9.5 g/L in high density 
fed-batch culture. With the ease of gen- 
erating different Scl2 constructs through 
molecular cloning, the overall resource 
cost for producing this highly tunable 
bacterial collagen material will likely be 
lower than recombinant collagen obtained 
from mammalian cell or even transgenic 
systems. 

TOWARD IN VIVO APPLICATIONS 

Translational science remains a less 
explored area for recombinant collagens. 
Recombinant human collagens obtained 
from systems with high scalability, such 
as Pichia and Nicotiana, have been for- 
mulated into hydrogels and explored for 
their potential as artificial cornea (Merrett 
et al., 2008) and wound dressings (Shilo 
et al., 2013), respectively. In terms of bac- 
terial collagen, Scl2 has been shown to 
be non-immunogenic, non-cytotoxic and 
non-thrombogenic (Peng et al., 2010). 



Recently, poly( ethylene glycol) crosslinked 
bioactive Scl2 hydrogels have been rein- 
forced with an electrospun polyurethane 
mesh to achieve suitable biomechanical 
property for vascular grafts (Browning 
et al., 2012). The rate of cell migra- 
tion is tunable through altering protein 
concentration in the material. Further 
investigation, especially the degradation 
and turnover time of Scl2 material in vivo 
is needed to evaluate its suitability as 
medical implants. 

CRITICAL PROPERTIES FOR A 
SUCCESSFUL COLLAGEN PRODUCTION 
STRATEGY 

Figure 1 illustrates the current production 
method and material fabrication strate- 
gies for recombinant bacterial collagen 
Scl2. Large amounts of bioactive collagen 
molecules can be produced with simple 
gene manipulation, large scale produc- 
tion and high throughput purification. 



Functional collagen proteins could then be 
cross-linked into hydrogels or used as coat- 
ing on other solid materials. Conditions 
and sequence manipulations which would 
trigger Scl2 self-assembly into large fibers 
are currently under investigation. Such 
fabricated materials could lead to the ulti- 
mate goal of designing and developing 
artificial extracellular matrices, an objec- 
tive important for tissue engineering as 
well as biomedical fields. With its high 
tunability and scalability accompanied by 
low complexity and cost, we believe the 
Scl2 recombinant bacterial collagen sys- 
tem has clear advantages which could not 
only circumvent the difficulties seen for 
recombinant human collagens, but also 
open up a brand new pathway for collagen 
production. 

LONG TERM PROSPECTS 

Extracted bovine collagen is likely 
to remain a staple for biomaterial 
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construction due to its low cost and use- 
ful material properties, but challenges in 
standardization and concerns for biobur- 
dens, as well as a desire to modify collagen 
sequence and function, will lead to contin- 
ued interest in research and applications 
with recombinant collagens. As described 
here for the specific Scl2 protein from 
S pyogenes, bacterial collagens repre- 
sent a biosynthetic ground up approach, 
where a triple-helical non-animal collagen 
molecule with no specific bioactivity can 
be designed to include desired interac- 
tions and regulated degradation. Although 
research has focused on Scl2, the approach 
could be extended to other collagen-like 
proteins from bacteria and viruses, which 
may bring new properties and variations 
useful for biomaterials. Fundamental stud- 
ies on Scl2 and other bacterial collagens 
will continue to elucidate basic features 
of collagen, with a focus on chemistry, 
sequence, biological activity, and self- 
assembly. The success of the bacterial 
collagen approach toward biomaterial 
applications will depend on developing 
the capacity to generate biomaterials with 
desirable biomechanical properties useful 
for in vivo applications. 
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